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1.  SUMMARY 

The  psychophysical  method  of  limits  was  used  to  measure 
the  distance  at  which  observers  could  distinguish  military 
vehicles  photographed  in  natural  landscapes.  Obtained  from 
the  TNO-TM  Search_2  dataset,  these  pictures  either  were 
rear-projected  35  mm  slides  or  were  presented  on  a computer 
monitor.  Based  on  the  rational  that  more  difficult  vehicle 
targets  would  require  more  visual  pathways  for  recognition, 
difficulty  of  acquisition  was  defined  in  terms  of  the  relative 
retinal  area  required  for  recognition.  Relative  retinal  area 
was  derived  from  the  inverse  square  of  the  recognition 
distance  of  a particular  vehicle  relative  to  the  distance  of  the 
vehicle  that  could  be  seen  furthest  away.  Results  are 
compared  with  data  on  the  time  required  to  find  the  vehicles 
in  these  pictures,  These  comparisons  indicate  that  1)  the  two 
methods  are  complementary  with  respect  to  distinguishing 
different  degrees  of  acquisition  difficulty;  2)  recognition 
distance  thresholds  can  be  a suitable  means  of  defining 
standards  for  the  effectiveness  of  vital  graphic  information. 

Keywords:  vision,  graphics,  recognition,  distance,  retinal 
area,  measurement,  standards,  target  acquisition,  TNO-TM 
Search_2 

2.  INTRODUCTION 
2.1.  Background 

Graphic  designers  often  contend  that  their  work  is  too 
complex  to  be  adequately  represented  by  quantitative 
measurements  of  effectiveness.  Yet  the_ effectiveness  of 
camouflage  on  military  vehicles  or  the  legibility  of  warnings 
on  medications  are  examples  where  the  effectiveness  of 
graphic  design  has  life  and  death  consequences. 

The  power  of  the  printed  word  led  to  a long  history  of 
quantitative  research  based  on  measurement  of  reading  speed 
which  has  resulted  in  standards  for  the  legibility  of  black 
letters  on  white  backgrounds.’’  ^ In  an  early  attempt  to  extend 
such  work  to  include  color,  Paterson  &Tinker  (1931) 
measured  reading  speed  for  words  printed  in  various  colors 
on  various  colored  backgrounds.^  They  found  that 
black/white  were  the  most  effective  combination.  A year 
later  using  the  same  colors,  Preston,  Schwankl  & Tinker 
measured  the  effectiveness  of  colored  print  in  terms  of 
reading  distance.'’  Blue/white,  black/yellow,  and  green/white 
letter/background  combinations  were  found  to  be  the  most 
effective.  However,  their  results  received  little  attention 
because  subsequent  replications  using  reading  speed  and 

recognition  time  continued  to  find  black/white  to  be  best 
while  recognition  time  for  other  color  combinations  varied 


proportionally  to  lightness  contrast, This  was  seen  as 
being  consistent  with  vision  theory  that  the  middle  and  long 
wavelength  based  lightness-contrast  mechanisms  were  the 
primary  pathways  for  image  detail. 

Since  color  seemed  to  have  no  quantitative  effect  on 
legibility,  its  role  in  graphic  design  was  presumed  to  be 
wholly  a matter  of  aesthetic  judgement.  The  results  of 
Preston,  et  al,  were  conveniently  forgotten.  More  recent 
contrary  data  on  subjective  legibility  were  ignored,’  Also 
overlooked  were  data  indicating  different  time  constants  of 
visual  color  mechanisms.®'^  Convenience  was  probably 
another  reason  why  most  studies  after  the  1930's  evaluated 
legibility  using  time  measurements  rather  than  distance. 

Yet  tachistoscopic  presentations  are  not  representative  of 
reading  tasks  in  the  market  place.  Asked  to  measure  the 
legibility  of  health  warnings  printed  in  color  on  tobacco 
packages,  Nilsson  & Percival  reasoned  that  measuring 
legibility  in  terms  of  distance  made  more  sense  from  a 
consumer's  perspective.'®  However,  greater  distance  did  not 
adequately  reflect  subjectively  greater  ease  of  reading. 
Legibility  was  better  described  in  terms  of  the  required  retinal 
image  area  based  on  the  inverse  square-root  of  distance. 
Subsequent  research  using  distance  to  measure  the 
effectiveness  of  foreground/background  combinations  of  the 
six  primary  colors  in  messages,  symbols,  and  outline 
drawings  indicated  that  chromaticity  contributed  substantially 
to  effectiveness, ” This  effect  was  not  revealed  when 
effectiveness  was  measured  in  terms  of  reading  speed 
because  chromaticity  pathways  are  considerably  slower  than 
lightness-contrast  pathways. 

The  effectiveness  of  camouflage  depends  on  both  color  and 
pattern  perception.  Therefore,  search  time  may  not 
adequately  reflect  the  contribution  of  color  in  recognizing 
such  targets.  The  availability  of  the  TNO-TM  Search_2 
dataset  of  high  resolution  images  together  with  data  on  search 
time  proved  an  opportunity  to  compare  distance  with  time 
based  measurements  of  visual  effectiveness.’^  To  help 
develop  quantitative  standards  for  more  effective  graphic 
design,  this  study  evaluates  data  obtained  by  both  methods. 

2.2.  Retinal  Difficulty 

The  relationship  between  the  distance  at  which  a target  can 
be  recognized  and  the  target's  visual  effectiveness  is  not  as 
simple  as  might  be  supposed.  When  attention  is  directed  at  a 
target  in  a scene,  the  target's  image  falls  on  the  foveal  portion 
of  the  retina.  Since  the  fovea  has  about  one  afferent  neuron 
for  every  photoreceptor,  the  area  of  the  target's  image  is 
proportional  to  the  number  of  visual  pathways  available  to 
convey  information  about  the  target.  If  all  targets  at 
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recognition  threshold  produce  the  same  critical  amount  of 
Information  in  the  afferent  pathways,  a target’s  visual 
effeetivencss  or  difficulty  can  be  measured  in  terms  of  the 
retinal  area  needed  for  recognition.  The  retinal  area  required 
for  recognition  in  turn  depends  on  the  amount  of  information 
per  unit  area  of  a target's  image.  We'll  define  a visually 
effective  target  as  one  that  provides  enough  information  for 
recognition  in  a small  retinal  area.  Conversely,  a difficult 
target  is  one  that  provides  enough  information  only  when  its 
retinal  area  is  large.  At  recognition  threshold,  actual  retinal 
areas  need  not  be  calculated  to  compare  the  targets  in  terms 
of  how  much  information  they  provide.  The  ratio  of  their 
effectiveness  or  difficulty  depends  only  on  the  ratio  of  their 
threshold  retinal  areas. 

The  area  of  a target's  retinal  image  is  proportional  to  the 
target's  size  and  is  inversely  proportional  to  the  square  of  its 
distance.  Hold  target  size  constant  for  the  moment.  A 
measurement  of  the  maximal  distance  at  which  a target  can 
be  recognized  is  inversely  proportional  to  the  retinal  area 
needed  for  recognition.  A long  threshold  distance  means  a 
small  retinal  area  and  therefore  represents  a visually  effective 
target.  Conversely  a short  threshold  di.stancc  must  represent 
a visually  difficult  target.  The  ratio  of  their  effectiveness  or 
difficulty  can  therefore  be  determined  by  the  ratio  of  their 
threshold  distances  squared.  Since  the  prc.scnt  research 
concerned  measuring  the  effectiveness  of  camouflage,  a ratio 
that  reflects  difficulty  of  recognition  was  used.  The  target 
that  was  recognized  furthest  away  was  taken  as  the  standard. 
Its  small  retinal  area  was  set  to  a unit  value  " I ".  The  retinal 
areas  of  all  other  targets  at  threshold  were  .sealed  as  multiples 
of  this  unit  value  and  the  result  for  each  called  its  retinal 
difficulty. 

In  practice,  these  calculations  were  easy.  Due  to  the  invcr.se 
relationship  between  distance  and  area,  retinal  difficulty  was 
obtained  by  dividing  the  threshold  distance  squared  of  each 
target  into  the  threshold  distance  squared  of  the  target  that 
was  recognized  furthest.  As  an  example,  assume  that  a 
certain  difficult  target,  X,  had  a threshold  distance  of  2 
meters  and  that  the  least  difficult  target.  Y.  had  a threshold  of 
4 meters.  How  much  larger  Is  the  retinal  area  of  X compared 
to  Y?  The  retinal  area  of  X is  proportional  to  1 / 2";  the 
retinal  area  of  Y is  proportional  to  1 / 4".  In  finding  the  ratio 
of  these  proportions,  their  proportional-to-actiial-retinal- 
area  aspects  cancel,  and  the  result  directly  equals  the  ratio  of 
their  retinal  areas.  Representing  retinal  difficulty  of  target  X 
as  Rx,  the  proportional  retinal  area  of  targets  X and  Y as  Ax 
and  Ay,  and  their  threshold  di.stanccs  as  Dx  and  Dy,  we  have: 

Rx  = A, /Ay 
= (l/Dx)'  / (I/Dy)- 
= Dy^  / Dx^  = 4^  / 2-  = 4 

The  more  difficult  target  requires  4 times  the  area  at 
recognition  threshold  than  the  least  difficult  target. 
Accordingly  X's  retinal  difficulty  equals  the  value  "4" 
compared  to  target  Y. 

What  happens  when  targets  differ  in  size?  The  answer 
involves  the  concepts  discussed  so  far,  but  also  requires  some 
additional  concepts  including:  visage  - target  size  with 
respect  to  a plane  perpendicular  to  the  viewer,  retina! 
information  density  - amount  of  information  per  unit  area  of 
the  target's  retinal  image,  and  usable  information  density  - a 
quantity  which  reflects  the  limit  imposed  by  visual  acuity. 


Taken  into  consideration.  thc\'  explain  such  obvious  matters 
as  why  a large  image  can  be  visually  effective  even  though  it 
has  a low  information  density,  or  why  it  is  harder  to 
camouflage  a tank  than  a jeep.  It  was  considered  premature 
to  deal  with  these  concepts  here.  In  the  present  study,  retinal 
difficulty  of  seeing  the  targets  was  measured  only  in  terms  of 
image  distance.  Target  area  was  taken  into  account  using 
graphic  analysis  to  reveal  its  effect. 

3.  METHOD 

3.1.  Subjects 

Subjects  were  recruited  by  posters  on  campus  and  consisted 
primarily  of  psychology  majors  in  their  2nd  and  3rd  years  of 
study.  They  were  screened  for  normal  visual  acuity  using  a 
Snellen  chart  and  screened  for  normal  color  vision  using  the 
Dvorinc  Test.  The  purpose  of  the  experiment  was  explained. 
They  were  asked  to  respond  when  they  could  no  longer 
recognize  the  target  in  the  picture  before  them  as  a vehicle 
while  the  picture  moved  away  and  to  respond  when  they  first 
recognized  the  target  as  a vehicle  while  the  picture  moved 
towards  them.  Thev  were  given  several  practice  trials  to  get 
acquainted  with  using  the  controls  and  making  judgements. 
Three  females  and  t«  o males  viewed  slide  projected  images. 
Four  males  and  two  females  viewed  the  images  on  a 
computer  monitor. 

3.2.  Apparatus 

The  subject  was  seated  at  one  end  of  an  8 meter  test  track  in  a 
long,  completely  black,  dark  room.  A carriage  riding  on 
linear-bearings  either  carried  a Kodak  Ektagraphic  35  mm 
slide  projector  with  a 2.5  inch  lens  projecting  155  cm  onto  an 
HP  rear  projection  screen  or  carried  a Dell/Sony  D1025,  17 
inch,  color  monitor  with  1280  X 1024.  0.25  pitch  display.  A 
computer-controlled  stepping-motor  accelerated  and 
decelerated  the  carriage  at  5 cm/.s'  or  maintained  a .steady 
velocity  of  10  cm/s.  Carriage  position  was  continuously 
monitored  by  an  independent  optical-encoder  and  electronic 
register. 

Operational  safety  was  ensured  by  program  interrupts,  limits 
set  in  the  dedicated  motor  controller,  and  an  independent 
system  of  limit  switches  that  operated  a clutch  and  brake  on 
the  chain  drive.  The  computer  also  recorded  the 
measurements,  signalled  when  the  image  should  be  changed, 
and  waited  for  the  subject's  instructions.  Control  buttons 
enabled  the  subject  to  direct  the  computer  to  start  a trial,  read 
the  distance,  or  repeat  the  present  trial.  Images  were  changed 
manually  by  a rc.scarcher  who  was  present  at  all  times. 

3.3.  Images 

A Polaroid  Sprint  Scan  35+  made  35  mm  slides  from  a CD- 
ROM  disk  containing  Toct,  Bijl.  Kooi,  and  Valeton's 
TNO_TM  Search_2  data  set  of  high  resolution  (3072  X 2048 
pixel)  images  of  various  military  vehicles  in  mixed  rural 
landscapes  of  green  foliage  and  pale  yellow  grass.'"  The 
scanner  was  calibrated  in  terms  of  a Kodak  color  calibration 
slide  included  on  the  CD-ROM.  Accuracy  of  color 
reproduction  was  tested  with  a Topcon  BM-7  colorimeter. 
The  Y.  x.  and  y values  for  the  eight  saturated  color  patches 
from  brown  to  blue  correlated  +.60.  +.97,  and  +.77 
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respectively  with  the  Kodak  values.  Generally  smaller  y 
values  presumably  represented  short  wavelength  absorption 
by  the  screen.  Gray  scale  reproduction  correlated  +.92  with 
the  values  on  the  slide. 

In  many  of  the  slide-projected  images,  the  vehicle  was 
difficult  to  discern  even  when  the  image  was  moved  close  to 
the  subject.  In  a few  images,  the  vehicle  could  still  be 
recognized  near  the  far  end  of  the  track.  Eliminating  these 
left  27  slides  that  were  tested.  Non-uniformity  of  brightness 
(mean  = 89  cd/m^,  sd  = 38)  across  the  central  portion  of  rear- 
projected  screen  was  a concern  since  vehicle  position  varied 
considerably.  Viewing  the  CD-ROM  images  directly  on  a 
computer  monitor  produced  a crisper  appearance  overall,  but 
the  images  were  smaller  than  the  projected  images. 

Therefore  these  images  were  enlarged  four  times  with  the 
vehicle  approximately  centered  on  the  screen.  Twenty-eight 
of  the  most  suitable  ones  were  selected  for  testing  with  the 
monitor.  The  Y value  of  the  saturated  color  patches  on  the 
monitor  correlated  +.99  with  the  Kodak  values,  but  the  x and 
y values  could  not  be  measured  for  most  colors.  Gray  scale 
correlation  was  -I-.92  . 

3.4.  Procedure 

The  method  of  limits  was  used  to  measure  recognition 
distance  thresholds  for  the  vehicles  in  the  images.  A within- 
subjects,  ABBA  counterbalanced  design  determined  the  order 
of  image  presentation.  Each  image  was  initially  presented 
close  to  the  subject.  The  location  of  the  target  vehicle  was 
pointed  out  or  verified  with  the  subject.  When  ready,  the 
subject  signalled  the  computer  to  back  away  the  image. 

When  the  subject  could  no  longer  recognize  the  target  as  a 
vehicle,  he/she  signalled  the  computer  to  record  the  distance. 
The  carriage  kept  moving  back  a fixed  plus  random  distance 
and  was  then  brought  to  a halt.  The  procedure  was  then 
reversed  with  the  carriage  moving  forward  until  the  subject 
could  recognize  that  the  target  was  a vehicle. 

Ten  such  measurements  were  taken  in  succession  and  ten 
thresholds  calculated  using  running  averages.  The  two 
thresholds  that  differed  most  from  the  mean  were  dropped 
and  the  mean  and  standard  deviation  of  the  remaining  eight 
were  recorded.  All  images  were  tested  in  single  sessions  that 
lasted  between  90  to  120  minutes.  Subjects  rested  between 
images,  could  take  a longer  break  when  they  wanted,  and 
were  asked  to  rest  a few  minutes  midway  through  a session. 
The  two  orders  of  presentation  were  tested  on  separate  days. 
At  the  end  of  the  last  session,  all  subjects  who  viewed  the 
monitor  pictures  were  asked  to  look  at  each  picture  again  at  a 
distance  about  0.2  meter  and  rate  how  difficult  it  was  to  see 
the  vehicle  using  a ten-point  scale. 

4.  RESULTS 

4.1.  Slide-Projected  Images 

Table  1 at  the  end  of  this  paper  provides  the  mean  threshold 
distances  and  standard  deviations  for  the  subjects  who 
viewed  the  slide  projected  images.  Up  and  down  refer  to  the 
order  in  which  the  images  were  tested.  Results  for  each 
image  were  averaged  across  subjects.  As  explained  in 
Section  2.2,  each  image's  retinal  difficulty  was  calculated  on 
the  basis  of  its  mean  threshold  distance  and  the  threshold 


distance  of  image  #34,  whose  vehicle  could  be  recognized 
farthest  away. 

Figure  1 shows  retinal  difficulty  of  the  27  slide  projected 
images  that  were  tested.  The  images  are  arranged  in 
decreasing  order  of  target  size  based  on  Toet  et  al.'s  report.'^ 
Images  were  arranged  in  order  of  decreasing  target  size  to 
reveal  the  effect  of  target  area.  For  comparison,  Toet  et  al's 
search  time  for  these  targets  is  also  shown.  Not  surprisingly, 
both  retinal  difficulty  and  search  tend  to  increase  as  target 
size  decreases.  Retinal  difficulty  correlated  -0.69  with  target 
size;  search  correlated  -0.43.  The  correlation  of  the  retinal 
and  search  results  was  +0.80.  For  the  larger  targets  on  the 
left,  retinal  difficulty  increases  faster  with  decreasing  size 
than  does  search  time.  Compared  to  search  time,  retinal 
difficulty  was  notably  larger  for  images  number  20,  16,  and 
30.  In  each  of  these  the  vehicle  is  well  outlined  against  its 
background  and  discernable  features.  Image  2 was  noted  for 
taking  longer  to  find  than  might  be  expected  from  its  retinal 
difficulty.  Though  the  target  in  image  2 blends  well  with  the 
background,  the  reader  should  bear  in  mind  that  its  location 
was  indicated  to  subjects  doing  the  distance  viewing. 

For  further  insight  into  how  the  distance  and  time 
measurements  differ,  the  effect  of  target  size  on  difficulty 
was  removed.  The  relative  value  of  the  reciprocal  of  target 
size  was  subtracted  from  relative  values  of  retinal  difficulty 
and  search  time.  The  results  were  then  restored  to  retinal 
difficulty  and  search  time  values.  Figure  2 shows  retinal 
difficulty  and  search  time  results  with  the  image  area  factor 
removed.  Both  functions  have  lost  their  generally  upward 
trend  as  target  size  decreases.  Yet  various  images  such  as 
numbers  16,  20,  30  , and  2 continue  to  differ  substantially  in 
recognition  difficulty  using  either  retinal  or  search 
measurements. 

To  help  identify  where  the  two  sets  differ,  the  images  were 
arranged  in  order  of  increasing  difference  between  the  retinal 
and  search  measurements.  These  results  are  shown  in  Figure 
3.  This  reveals  that  for  most  of  the  images  there  was  marked 
correspondence  of  changes  in  both  retinal  difficulty  and 
search  time,  even  when  the  measurements  differed  somewhat. 
Comparison  of  those  images  that  produced  similar  effects 
with  those  that  differed  revealed  no  systematic  characteristics 
related  to  these  trends. 

Both  the  distance  threshold  and  the  search  time 
measurements  were  tested  for  the  significance  of  the 
differences  between  their  means  using  Duncan's  test.  The 
results  are  shown  in  Figure  4.  The  means  for  distance 
threshold  and  for  search  time  are  arranged  in  decreasing  order 
to  represent  increasing  difficulty  of  recognition  from  left  to 
right.  The  horizontal  lines  below  or  above  the  image 
numbers  indicate  the  images  that  do  not  differ  significantly. 
Generally,  distance  thresholds  effectively  distinguished 
targets  that  are  easy  to  recognize  while  search  time  is  poor  at 
distinguishing  these  same  targets.  To  a considerable  extent 
the  opposite  seems  to  hold  for  images  that  are  difficult  to 
recognize.  Of  the  32  pairs  of  images  whose  search  times  did 
not  differ  significantly,  only  one  pair,  10  and  18,  had  distance 
thresholds  that  did  not  differ  significantly.  Of  the  21  pairs 
whose  threshold  distances  did  not  differ  significantly,  four 
pairs,  10-20,  36-42,  38-32,  and  29-26,  did  not  differ  in  their 
search  times. 


6-4 


IMAGE  NUMBER 


RETNAL  -X-  SEARCH 


Figure  1.  Retinal  difficulty  and  search  time  of  images  that  have  been  arranged  in  order  of  de 
target  size. 


IMAGE  NUMBER 
REnriAL  -X-  SEARCH 


Figure  2.  Retinal  difficulty  and  search  time  with  target  size  effect  removed.  The  images  arranged  in 
order  of  decreasing  target  area. 
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Figure  3.  Images  in  Figure  2 rearranged  to  increasing  difference  between  their  retinal  difficulty  and 
search  time 


Figure  4.  Results  of  Duncan's  test  on  the  significance  of  differences  between  mean  threshold  distances 
and  between  mean  search  times. 
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Figure  5.  Retinal  difficulty  and  mean  search  time  for  recognizing  the  targets  in  the  various  images 
presented  on  a monitor.  Images  are  arranged  in  order  of  decreasing  target  size. 
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Figure  6.  Retinal  difficulty  is  compared  with  the  mean  rating  of  subjective  difficulty  of  seeing  the  vehicle 
in  each  image.  Images  are  arranged  in  order  of  decreasing  target  size. 
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Figure  7.  The  effect  of  size  has  been  removed  from  retinal  difficulty  and  from  search  time.  The  results 
are  plotted  with  the  images  in  ascending  order  of  difference  between  the  two  measurements. 
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Figure  8.  Results  of  Duncan's  test  on  the  significance  of  differences  between  mean  threshold  distances 
and  between  mean  search  times. 
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4.2.  Monitor  images 

The  threshold  distance  data  for  these  images  are  provided  in 
Table  2 at  the  end.  As  was  done  with  the  slide  projected 
images,  the  distance  thresholds  for  each  of  the  28  images 
tested  were  averaged  across  observers  and  converted  to 
retinal  difficulty  values.  The  results  are  plotted  in  Figure  5 
with  the  images  arranged  in  descending  order  of  target  size. 
For  eomparison,  the  search  time  results  measured  by  Toct  et 
al.  for  these  images  are  also  shown.  As  target  size  decreased, 
both  retinal  difficulty  and  search  time  tended  to  increase. 
Retinal  difficulty  correlated  -.61  with  target  size  and  search 
time  -.49  with  target  size.  As  happened  with  the  slide 
images,  the  two  functions  appear  to  track  each  other  closely. 
Except  for  the  smallest  target.  4,  subjects  making  distance 
threshold  judgements  had  more  trouble  with  the  small  targets 
(23,  29,  44)  than  those  for  whom  search  time  was  measured. 
The  target  in  image  4 lacks  both  a distinctive  shape,  due  to  its 
head-on  orientation  and  lack  of  contrast  with  its  background. 
Targets  23,  29  and  44  have  ample  background  contrast  and 
have  cues  to  their  shape  revealed  by  shadow  and  high-lights 
on  the  vehicles.  Nineteen  of  these  images  were  enlargements 
from  the  same  images  tested  as  projected  slides.  The 
correlation  of  the  two  sets  of  measurements  was  +.79. 

For  more  information  about  the  distance  thresholds,  retinal 
difficulty  was  compared  with  the  ratings  of  subjective 
difficulty  that  were  made  by  the  subjects  in  the  present  .study 
who  viewed  the  images  on  a monitor.  Figure  6 shows  retinal 
difficulty  and  the  mean  ratings  of  subjective  difficulty  plotted 
as  a function  of  decreasing  target  area.  The  two  sets  of 
measurements  correlate  +.85.  The  most  notable  differences 
again  involve  small  targets  in  images  number  23  and  44, 
which  had  relatively  larger  retinal  difficulty  values  than 
subjective  ratings.  A similar  difference  was  found  above 
when  the  retinal  difficulty  of  these  targets  was  compared  to 
their  search  times.  On  the  other  hand,  the  largest  target  in 
image  41  and  a medium  sized  target  in  image  28  were  rated 
notably  more  difficult  than  their  retinal  difficulty.  While 
large,  the  target  in  image  41  is  partly  obscured  by  a tree  and 
sits  in  the  vicinity  of  other  complex  outlines  produced  by 
dark  tree  tops  again.st  pale  grass.  The  vehicle  in  image  28 
happens  to  line  up  with  a light  to  dark  transition  across  most 
of  the  scene.  The  subjective  ratings  correlated  less  w'cll  w'ith 
search  time,  r - +.72  than  did  retinal  difficulty,  which  is 
understandable  since  the  latter  involved  the  same  subjects. 

To  see  how  differences  between  retinal  difficulty,  search 
time,  and  difficulty  ratings  were  related  to  the  size  of  the 
targets,  the  effect  of  target  size  was  removed  as  was  done 
with  the  data  for  projected  images.  Figure  7 shows  the 
results  for  retinal  difficulty  and  search  time  with  the  images 
arranged  in  order  of  difference  between  the  two  results.  The 
biggest  differences  between  the  two  types  of  measurement 
occurred  for  the  images  at  each  end  of  the  graph  depending 
on  whether  the  search  time  (on  the  left)  or  retinal  difficulty 
(on  the  right)  had  the  larger  value.  As  was  noted  when  the 
size  effect  was  not  removed,  the  two  types  of  measurement 
still  differ  mo.st  for  images  4,  23,  & 44.  Removing  the  effect 
of  size  substantially  reduced  their  correlation  from  +.72  to 
+.52.  Removing  the  size  effect  did  not  change  the  correlation 
between  retinal  difficulty  and  the  difficulty  ratings. 

Significance  of  the  differenee  between  means  of  distance 
thresholds  of  the  images  presented  on  a computer  monitor 
was  calculated  using  Duncan's  te.st.  The  results  arc  compared 


with  those  for  the  mean  search  times  of  these  images  in 
Figure  8,  Of  the  26  image  pairs  that  did  not  differ 
significantly  in  search  time,  only  3 (42-36,  29-26,  38-32)  did 
not  differ  in  threshold  distance.  Similarly  of  the  21  image 
pairs  that  did  not  differ  in  threshold  di.stance,  3 (10-20,  38-32, 
29-26)  also  did  not  differ  in  threshold  distance.  Distance 
thresholds  were  somewhat  better  at  distingui.shing  the  easier 
images;  search  time  was  better  for  the  more  difficult  images. 

5.  DISCUSSION 

What  do  recognition  distance  thresholds  reveal  about  the 
targets  to  be  recognized?  Target  size  had  a substantial  effect 
on  both  the  distance  and  time  based  measurements  of  visual 
recognition.  This  can  be  seen  in  Figures  1 and  5 by  the 
general  increase  in  retinal  difficulty  and  search  time  with 
decrease  in  target  size.  For  the  slide-projected  images,  retinal 
difficulty  correlated  -.69  with  size,  and  search  time  correlated 
-.46;  for  the  monitor  images,  the  correlations  were  -.61  and  - 
.49  respectively.  The  higher  correlations  of  retinal  difficulty 
with  target  size  arc  understandable  given  the  interaction 
between  size,  distance,  and  retinal  area.  While  statistically 
significant  (for  27  or  28  pairs,  minimum  significant  r = .38) 
these  correlations  leave  nearly  60%  of  the  variance 
undetermined  b\'  the  distance  measurements.  Some  of  that 
variance  was  due  to  measurement  error.  Despite  the  error 
arising  from  intra-  and  inter-observer  variability,  Duncan's 
tests  revealed  that  a majority  of  the  adjacent  distance 
thre.sholds  differed  significantK'.  To  what  extent  did  these 
distinctions  depend  on  target  size  which  interacts  with 
distance  to  determine  the  area  of  the  retinal  image?  This  is 
revealed  in  Figures  2 and  7.  w here  the  effect  of  size  is 
removed  from  these  measurements.  Substantial  variations  in 
both  retinal  difficult)  and  search  time  as  a function  of  the 
various  images,  indicate  that  other  charactcri.stics  of  the 
images  influenced  both  measurement  methods. 

For  additional  insight  into  what  other  characteristics 
influenced  these  judgements,  the  effects  of  contrast  were 
examined.  The  TNO-TM  Scarch_2  data  set  provided 
measurements  of  the  target  vehicles'  luminance  and  the 
surround  luminance.  From  this  a rough  estimate  of  contrast 
was  derived  primarily  based  on  the  ratio  between  the  dark 
area  of  the  vehicle  and  the  usually  lighter  grass.  Retinal 
difficulty  and  search  time  correlated  similarly  with  contrast 
(mean  r=  -.35  and  -.37,  respectively).  Since  these  are  not 
statistically  significant  correlations,  this  analysis  indicates 
that  it  was  not  possible  to  estimate  the  contribution  of 
contrast  to  recognition  from  these  rough  estimates.  Re.scarch 
on  colored  symbols  and  backgrounds  has  found  that 
chromaticity  may  contribute  more  to  distance  thre.sholds  than 
docs  lightness  contrast.'^  Chromaticity  data  were  not 
available  here,  but  their  availahilit)'  should  be  considered  in 
future  research. 

Finally  there  is  the  effect  of  target  shape.  The  vehicles 
generally  appeared  to  have  the  mo.st  distinct  shape  when  they 
stood  sidew'ays  to  the  vicw'cr.  A rough  estimate  of  the  effect 
of  shape  was  derived  using  the  absolute  value  of  a sine 
transformation  of  the  vehicle's  aspect  angle  in  each  image. 
For  the  slide-projcctcd  images,  retinal  difficulty  and  search 
time  correlated  -.39  and  -.44  with  this  estimate  of  the  shape 
effect;  correlations  for  the  monitor  images  were  not 
significant  (r  = -.25  and  -.28  respectively).  The  lower 
correlations  of  both  retinal  difficulty  and  search  time  for  the 
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images  viewed  on  the  monitor  were  attributed  to  the  different 
images  in  the  two  sets,  since  viewing  conditions  were 
constant  for  the  search  time  data.  The  barely  significant 
correlations  with  shape  of  data  from  the  slide-projected 
images  indicates  that  some  of  the  differences  in  recognition 
were  due  to  shape,  but  that  aspect  angle  is  not  adequate  for 
describing  the  effect  of  shape  on  these  measurements. 

How  did  the  retinal  difficulty  measurements  compare  with 
the  search  time  measurements?  Evidence  that  the  two 
methods  were  differently  affected  by  target  size  is  revealed  in 
the  results  of  the  Duncan's  tests  in  Figures  4 and  8.  For  both 
sets  of  images,  threshold  distances  tended  to  distinguish 
larger  targets  better  than  smaller  ones,  while  search  times 
tended  to  distinguish  smaller  targets  better  than  larger  ones. 
Nevertheless,  Figures  1 and  4 and  show  that  both 
measurements  of  recognition  difficulty  generally  increased 
with  decreasing  target  size.  This  was  also  indicated  by  their 
significant,  negative  correlations  with  size  as  discussed 
above.  However,  many  images  were  exceptions  to  this  trend 
- some  more  so  for  retinal  difficulty,  others  for  search  time. 
The  two  methods  of  measurement  correlated  -I-.80  for  the 
projected  images  and  +.12  for  the  monitor  images.  Since 
both  measurements  correlated  substantially  with  image  size, 
perhaps  this  explains  their  similarity.  Removing  the  size 
effect  reduced  these  correlations  to  +.6\  and  -f-,52 
respectively.  Note  that  the  reduction  was  nearly  the  same  for 
both  sets  of  images.  This  similarity  indicates  that  target  size 
had  a similar  effect  on  both  measurements.  With  the  size 
effect  removed,  the  results  were  arranged  in  order  of  the 
amount  of  difference  between  the  two  sets  of  measurements 
in  Figures  3 and  7.  In  both  figures,  retinal  difficulty  and 
search  time  vary  considerably  as  the  images  change.  This 
indicates  that  differences  other  than  the  size  of  the  images 
affected  both  methods  of  measurement.  The  close  tracking  of 
the  two  sets  of  results  in  both  figures  might  lead  one  to  think 
that  two  methods  were  similarly  affected  by  these  other 
characteristics.  This  was  not  the  case.  Reorganizing  the 
results  in  terms  of  the  differences  does  not  change  their 
correlations.  Despite  appearances,  the  pairs  of  functions  in 
Figures  3 and  7 still  have  r values  of  only  -I-.61  and  +.52. 

Since  Duncan's  tests  indicated  a majority  of  the 
measurements  were  significantly  different,  the  remaining 
variance  was  not  all  due  to  error.  Evidently  retinal  difficulty 
and  search  time  differed  in  how  they  were  affected  by  other 
characteristics  of  the  images. 

Which  method  is  better  for  distinguishing  the  difficulty  of 
recognizing  the  vehicles?  Duncan's  tests  indicate  that 
threshold  distances  distinguished  more  images  than  did 
search  time.  For  another  way  to  address  this  question, 
subjects  were  asked  to  rate  the  overall  difficulty  of  seeing  the 
vehicle  in  each  of  the  monitor-presented  images.  Figure  6 
shows  that  the  ratings  increased  more  steadily  with  target  size 
than  did  retinal  difficulty.  A comparison  with  Figure  5 
shows  this  was  also  the  case  with  respect  to  search  time. 
Retinal  difficulty  correlated  more  strongly  with  the  difficulty 
ratings  ( r = +.85)  than  did  the  search  times  {r  = +.72). 
However,  this  is  hardly  a fair  comparison  since  the  search 
times  were  obtained  from  different  subjects.  Researchers 
studying  the  effectiveness  of  camouflage  or  other  graphics 
should  consider  adding  this  simple  measurement  for 
comparison  purposes. 


Overall,  the  results  obtained  by  Toet  et  al.  and  the  present 
study  indicate  that  the  search  time  and  retinal  difficulty 
probably  reflect  different  characteristics  of  the  images.  What 
does  this  imply  for  measuring  the  difficulty  of  recognizing 
camouflaged  vehicles  or,  more  generally,  for  measuring  the 
effectiveness  of  graphics?  Look  again  at  the  results  from 
Duncan's  tests  in  Figures  4 and  8.  On  average  only  7%  of  the 
images  that  could  not  be  distinguished  by  search  time  were 
also  not  distinguished  by  threshold  distance,  and  17%  of  the 
images  not  distinguished  by  distance  were  also  not 
distinguished  by  search  time.  This  suggests  that  the  two 
measurements  are  indeed  complementary.  The  use  of  both 
methods  together  would  improve  distinguishing  camouflaged 
vehicles  in  terms  of  recognition  difficulty.  From  a broader 
perspective,  the  favorable  comparison  with  search  time  data 
on  a standard  set  of  images,  provides  further  evidence  that 
distance  threshold  measurements  are  an  effective  means  of 
measuring  the  effectiveness  of  complex  graphic  displays. 
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